3574 results found.
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
8.6 MByte Production Status:
Newly created-finished
Use:
Textual Entailment and Paraphrasing
-
Paper title:Automatic Compilation of Resources for Academic Writing and Evaluating with Informal Word Identification and Paraphrasing System
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Seid Muhie Yimam | Resources for Academic Writing | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Bilingual
Languages:
Egyptian Arabic English
Availability:
From Owner
License:
OpenSource
Size:
505 KByte Production Status:
Newly created-finished
Use:
Morphological Analysis
-
Paper title:Cairo Student Code-Switch (CSCS) Corpus: An Annotated Egyptian Arabic-English Corpus
-
Paper track:Speech/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Mohamed Balabel | Cairo Student Code-Switch Corpus | /N |
Documentation:
not yet available
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
pending
Size:
1122 sentences Production Status:
Newly created-in progress
Use:
Natural Language Understanding for Human-Robot Interaction
-
Paper title:Dialogue-AMR: Abstract Meaning Representation for Dialogue
-
Paper track:Multimodality/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Claire Bonial | Dial-AMR Corpus | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
Size:
None Production Status:
Newly created-finished
Use:
Question Answering
-
Paper title:ScholarlyRead: A New Dataset for Scientific Article Reading Comprehension
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Tanik Saikh | Machine Reading Comprehension Dataset | /N |
Documentation:
No
Written
Evaluation Data,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
68.7 MByte Production Status:
Existing-used
Use:
Question Answering
-
Paper title:Contextualized Embeddings based Transformer Encoder for Sentence Similarity Modeling in Answer Selection Task
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Md Tahmid Rahman Laskar | CETE codes and datasets. | /N |
Documentation:
Check the readme.txt file given in the shared resources. It is written in English.
Written
Corpus,
Language Type:
Bilingual
Languages:
English Late Modern English
Availability:
Freely Available
License:
CreativeCommons
Size:
17520 texts OtherProduction Status:
Existing-updated
Use:
Corpus Creation/Annotation
-
Paper title:The Royal Society Corpus 6.0: Providing 300+ Years of Scientific Writing for Humanistic Study
-
Paper track:Written/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Stefan Fischer | Royal Society Corpus 6.0 Open | /N |
Documentation:
Accompanying website and several peer-reviewed publications, all written in English.
Written
Corpus,
Language Type:
Multilingual
Languages:
Chinese English French German Japanese Korean Russian Spanish
Availability:
Freely Available
License:
CC-BY-4
Size:
68000000 sentences Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:ParaPat: The Multi-Million Sentences Parallel Corpus of Patents Abstracts
-
Paper track:Written/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Felipe Soares | ParaPat | /N |
Documentation:
None
Not Applicable
Corpus,
Language Type:
Bilingual
Languages:
English Russian
Availability:
Freely Available
License:
Size:
2.3 MByte Production Status:
Newly created-finished
Use:
Text Mining
-
Paper title:Detecting Troll Tweets in a Bilingual Corpus
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Lin Miao | Bilingual troll tweets | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
English French German Italian Spanish
Availability:
Freely Available
License:
Creative Commons
Size:
59364453 sentences Production Status:
Newly created-finished
Use:
Word Sense Disambiguation
-
Paper title:Sense-Annotated Corpora for Word Sense Disambiguation in Multiple Languages and Domains
-
Paper track:Written/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Bianca Scarlini | OneSeC | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons BY-NC-SA 4.0
Size:
1.3 billion tokens Production Status:
Newly created-in progress
Use:
Corpus Creation/Annotation
-
Paper title:Collecting Tweets to Investigate Regional Variation in Canadian English
-
Paper track:Written/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Filip Miletic | CanEn: A corpus of tweets in Canadian English | /N |
Documentation:
None




